Data Quality in the Secondary Care Prescription Dataset (SCMD)

This notebook performs some basic quality checks on the SCMD data. In particular, we track (1) completeness of each field, (2) missing Defined Daily Doses (DDD) values, and (3) count how many hospitals appear every month.

Running this note book requires ProfileReport from pandas_profiling. The documentation is provided here.

Completeness of Each Field

Unique Products Missing DDD

Number of Hospitals per Month